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(54) Method and apparatus for Image coding and decoding 



(57) The optimum predictive image is selected as fol- 
lows for the entire area of an image sequence comprising 
a luminance signal expressing each pixel value, and a 
transmittance signal expressing the transparency of 
each pixel value, and described by the transmittance sig- 
nal of the target image to be coded as the area to be 
coded. The target image to be coded is predicted from 
at least one template typical of the image sequence, 
thereby generating a first predictive image. The target 



image to be coded is also predicted from the image dis- 
played chronologically before the target image, thereby 
generating a second predictive image. It is then deter- 
mined whether the first or the second predictive image 
has the least difference to the target image to be coded, 
and the predictive image with the least difference is 
selected as the optimum predictive image. 
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Description 

BACKGROUND OF THE INVENTION 

1. Reld of the invention 

The present invention relates to a method and appa- 
ratus for predictive coding, storage, and transfer of digital 
images, and to an apparatus for signal decoding and 
image synthesizing, and relates particularly to a method 
and apparatus for predictive coding, storage, and trans- 
fer of digital images obtained by synthesizing plural 
object-based image layers, and to an apparatus for sig- 
nal decoding and resynthesizing said object-based 
images. 

2. Description of the prior art 

J. Wang and E. Adelson have proposed a method 
for decomposing moving images into object-based lay- 
ers (a different layer for each object in the image) for cod- 
ing as a means of efficiently transmitting and recording 
moving images. Their method is described in "Layered 
Representation of Image Sequence Coding," J. Wang 
and E. Adelson, Proc. IEEE Int. Conf. Acoustic Speech 
Signal Processing. 1993. pp. V221-V224: and in "Lay- 
ered Representation for Motion Analysis," J. Wang and 
E. Adelson, Proc. Computer Vision and Pattern Recog- 
nition, pp, 361 - 366, 1993. 

This method is described below, assuming a motion 
picture containing fish, seaweed, and background. After 
each of the three elements composing this motion pic- 
ture is separately recorded as a chromakey image, the 
elements can be synthesized into a single composite 
image, or a sequence of plural image frames bracketing 
a particular image can be analyzed and separated into 
the three component parts. The method proposed by 
Wang et a!., analyzes the motion picture sequence to 
extract the fish, seaweed, and background, and separate 
each of these objects to a discrete layer. Each layer is 
then separately conpression coded. The coded data is 
then multiplexed by a multiplexer, adding information 
identifying the vertical relationship between the layers 
(i.e., which layer overlays which layer), for storage or 
transmission. When it is also necessary to identify tine 
relative transparency of the pixels or shapes In each 
frame layer, a transmittance signal is also coded and 
transmitted or stored with the object layers. 

Plural decoders are similarly required on the repro- 
duction side. The multiplexed data is first demultiplexed 
Into the separate layers, and the data in the separate lay- 
ers is then simultaneously decoded by the respective 
decoders. The objects in the reproduced layers are then 
overlaid by the image synthesizer based on the vertical 
layer relationship data to generate the reproduction (syn- 
thesized) image then displayed on the display apparatus. 

To improve the compression efficiency, an image 
representative of each layer (e.g., the fish, seaweed, or 
background scene) is determined and used as a refer- 



ence image for that layer. Note tiiat the template for each 
layer is selected to be most representative of the object 
sequence in that layer. These templates may be selected 
manually or automatically, and Wang etal., also describe 

5 a method for automatically generating these templates. 
The defined templates are compression coded first, 
and each object in the layer is coded by predictive 
approximation based on the displacement or deforma- 
tion of the generated template. By separating the motion 

10 picture into component objects, tiiose objects can be 
more accurately approximated without being influenced 
by other nearby objects. Objects at a sufficient distance 
from tfie camera can also be treated as still objects, mak- 
ing it possible to describe the deformation and dispiace- 

15 ment (change and movement) in such objects with few 
parameters. More specifically, an affine transformation 
is used to approximate a single object using six param- 
eters, tiiereby requiring few bits and achieving an 
extremely high compression rate. 

20 The method approximating objects using templates 
as described above, however, results in increased error 
when there is a large change in object shape or lumi- 
nance, and degrades the compression coding efficiency. 
In addition to approximation using templates, it is there- 

25 fore necessary to predict such changes using the images 
displayed chronologically before and after tiie image to 
be coded, and to adaptively select the optimum predic- 
tive image. 

Discrete predictive coding of each object also com- 

30 monly results in a mismatch between the contours of the 
coded object and tiie predictive object. Contour mis- 
matches increase tiie difference value, and prevent tiie 
efficient coding of the contour luminance and color dif- 
ference signals. 

35 Furthermore, tiie reproduction side must have three 
decoders to decode tiie objects in tiiree layers as 
described above. As a result, the number of reprodudble 
layers is limited by the number of decoders available to 
the reproduction side. A frame memory with sufficient 

40 capacity to store the decoder output is also needed to 
synthesize the reproduced objects in each layer, and tiie 
number of frame memory units is proportional to the 
number of layers. As the number of image layers 
increases, the overall size and cost of the decoder 

45 increase greatly. 

Synthesis of the output image according to the ver- 
tical relationship data of the layers also prevents selec- 
tive display of the layers, and prevents a selected layer 
from being moved from the coded position to a position 

so behind or in front of anotiier layer. More specif icaiiy. inter- 
activity is impaired. 

SUMMARY OF THE INVEN-nON 

55 Therefore, an object of the present invention is to 
prevent deterioration of the predictive image as a result 
of large changes in the shape or luminance of an object 
in a predictive coding method using templates, and to 
reduce the accumulation of prediction error over time. 
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To achieve this, the present invention converts the 
image to be coded using a particular transformation 
method to generate predictive images from at least one 
template and the image displayed chronologically before 
or after the image to be coded, and uses the predictive 5 
image with the least difference to the image to be coded 
as the optimum predictive image for that image. 

In addition, a new predictive image generated by 
averaging plural predictive images is added to the can- 
didate predictive images from which the predictive image 10 
with the least difference is selected as the optimum pre- 
dictive image for the image to be coded. 

Furthermore, the optimum predictive image is 
divided into plural predictive subareas, that is, the image 
to be coded is divided into plural target subareas. For is 
each target subarea in which there is at least one pixel 
value that should not be coded and the corresponding 
predictive subarea, the pixel values to be coded in the 
corresponding predictive subarea are then operated on 
using a known function to calculate a substitute pixel 20 
value, and this substitute pixel value is then substituted 
into the target subarea and corresponding predictive 
subarea for the pixel values therein that should not be 
coded. The difference signal is then obtained from the 
target and predictive subareas containing the substitute 2s 
pixel value. 

The second object of the present invention is to sup- 
press an increase in the difference value caused by a 
mismatch between the contours of the target object to 
be coded and the predictive object. 30 

To achieve this object, the predictive image and the 
target image to be coded are divided into plural predictive 
subareas and target subareas, respectively. Before the 
difference between corresponding predictive and target 
subareas is obtained, the pixel values to be coded in the 3S 
predictive subarea are operated on using a known func- 
tion to calculate a substitute pixel value for the corre- 
sponding target subarea in which there is at least one 
pixel value that should not be coded. This calculated sub- 
stitute pixel value is then substituted into the target sub- 40 
area and tiie corresponding predictive subarea for each 
pixel value therein not to be coded. The difference 
between the target subarea and predictive subarea is 
then calculated after making this pixel value substitution. 

The third object of the present invention is to achieve 4S 
a system whereby an image can be reproduced by 
means of a single decoder irrespective of tfie number of 
object layers composing the image, the image can be 
syrrttiesized using a frame memory that is not dependent 
upon the number of object layers, and layers can be so 
selectively reproduced, thus enabling high interactivity 
with the user. 

The present invention uses an apparatus for decod- 
ing and synthesizing digital images composed of plural 
superimposed image layers where said digital images ss 
are coded by separately compression coding each of the 
plural image layers, and then multiplexing the layers in a 
predetermined order. Preferably, this coded data is mul- 



tiplexed in sequence from either the background layer or 
the foreground layer. 

Said decoding and synthesizing apparatus accord- 
ing to the present invention comprises an external line 
input terminal, decoder, synthesizer, frame memory, and 
output means. The coded data is input to tiie external 
line input terminal, and each layer is tiien decoded to a 
reconstructed image in the sequence in which the coded 
data was multiplexed. This reconstructed image and the 
synthesized image supplied from tiie frame memory are 
input to the synthesizer, which generates a new synthe- 
sized image by merging the synthesized image and tiie 
reconstructed image. This new synthesized image is 
stored to the frame memory, and displayed by the output 
means. 

A first selector switch is also disposed between the 
external line input terminal and the decoder of the above 
decoding and synthesizing apparatus, and is controlled 
to not connect the external lane input terminal and 
decoder when the image of a layer that is not reproduced 
is input. In addition to the first selector switch, a second 
selector switch is disposed between the synthesizer and 
frame memory, and is controlled to not connect tiie syn- 
thesizer and frame memory when the image stored in 
the frame memory is not to be updated. 

BRIEF DESC RIPTION OF THE DRAWINGS 

The present invention will become more fully under- 
stood from tiie detailed description given below and the 
accompanying diagrams wherein: 

Rg. 1 is a block diagram of an image predictive cod- 
ing apparatus according to a first embodiment of the 
present invention; 

Rg. 2 is a diagram of a pictorial image showing an 
image synthesized by overlaying plural image lay- 
ers; 

Rg. 3 is a block diagram of a predictor shown in Rg. 
1; 

Rg. 4 is a view similar to Rg. 3, but showing an alter- 
native embodiment of the predictor; 
Rg. 5 is a diagram showing a pictorial image of the 
predictive method of the present invention; 
Rg. 6 is a block diagram of an image predictive cod- 
ing apparatus according to a second embodiment of 
the present invention; 

Rgs. 7A, 7B and 7C are diagrams showing the data 
structure generated by a predictive coding method 
using plural templates; 

Rg. 8 is a block diagram of an image predictive cod- 
ing apparatus according to a third embodiment of the 
present invention; 

Rgs. 9A, 9B and 9C are diagrams showing the effect 
of difference value reduction by pixel value substitu- 
tion according to the third embodiment of the present 
invention; 

Rg. 10 is a blockdiagram of a preferred embodiment 
of a decoder according to the present invention; 
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Fig. 1 1 is a blockdiagram of a digital image decoding 
and synthesizing apparatus according to a first 
embodiment of the present invention; 
Figs. 12A, 12B and 12C are diagrams showing the 
data structure generated by compression coding s 
plural image layers, and multiplexing the data; 
Fig. 1 3 is a blockdiagram of a digital imagedecoding 
and synthesizing apparatus according to a second 
embodiment of the present invention; 
Fig. 1 4 is a blockdiagram of a digital image decoding io 
and synthesizing apparatus according to a third 
embodiment of the present invention; 
Fig. 1 5 is a block diagram of a digital image decoding 
and synthesizing apparatus according to a fourth 
embodiment of the present invention; is 
Fig. 1 6 is a blockdiagram of a digital image decoding 
and synthesizing apparatus according to a fifth 
embodiment of the present invention; 
Fig. 1 7A is a block diagram of a digital image decod- 
ing and synthesizing apparatus according to a sixth 20 
embodiment of the present invention; and 
Fig. 1 7B is a diagram showing the steps for synthe- 
sizing the layers. 

Fig. 18 shows a table representing the operation of 
the digital image decoding and synthesizing appa- 2s 
ratus of Fig. 14; and 

Fig. 19 shows a table representing the operation of 
the digital image decoding and synthesizing appa- 
ratus of Fig. 15. 

30 

DESCRIPTION OF PREFERR ED EMBODIMENTS 

The input image processed by the present invention 
comprises objects in plural layers where the objects in 
each layer are defined by a luminance signal expressing as 
the pixel value, and a transmittance signal expressing the 
transparency state of each pixel value. This is also the 
case when a color difference signal is also present in 
addition to the luminance signal; note that discussion of 
the color difference signal is omitted in the following 40 
description. Furthermore, unless othenivise stated, 
image objects as referred to herein are composed of a 
luminance signal and transmittance signal, and the pixel 
values used herein refer to the value of the luminance 
signal. 45 

Fig. 2 shows the luminance signal of one image D 
1003 in a motion picture of plural frames, and tiie lumi- 
nance signals of the three components combined to form 
the luminance signal of image D 1003. The three com- 
ponent images of image D 1003 (fish A 1000, seaweed so 
B 1001, and background C 1002) can be separately 
recorded as chromakey images, and then combined to 
form image D 1003; or image D 1003 and an image 
sequence of plural frames before and after image D 1 003 
can be analyzed to separate image D 1003 into the three 55 
image components. These three image components are 
referred to as "layers" herein. Thus, the image sequence 
containing image D 1003 has three layers, the layer of 
fish A 1000, the layer of seaweed B 1001 , and the back- 



ground C 1002 layer. The number of layers will depend 
upon the image. 

A ti-ansmittance signal indicating the ratio at which 
the pixels should be overlaid is added to the objects in 
each layer. When tiie transmittance is 100%. an opaque 
object is present and the background cannot be seen; at 
0% transmittance, there is no object present and tiie 
background can be viewed with nothing in front. Trans- 
mittance levels between 0% and 1 00% occur when tiiere 
is a semitransparent object present, such as glass, and 
at object boundaries. The transmittance signals for fish 
A 1000 and seaweed B 1001 in Fig. 2 are shown as 
images A' and B*. respectively. It should be noted that 
this transmittance signal describes the shape of the 
object. 

As shown in Fig. 1 , luminance signal 1 000 and trans- 
mittance signal 1004 are input from the input terminal 
346, and stored to the input frame memory 330. The 
luminance signal and transmittance signal from tiie input 
frame memory 330 are input to tiie predictor 344 simul- 
taneously with the image stored in the first frame mem- 
ory 342 and the image stored in the second frame 
memory 340 through line 504. (An image divider 331 is 
shown for the purpose of modification, and is not pro- 
vided in this embodiment.) A template previously coded 
and reproduced is stored to a first frame memory 342, 
and an image (i.e.. not a tenplate) previously coded and 
reproduced is stored to a second frame memory 340. 
The predictor 344 generates tiie optimum predictive 
image from the images input thereto. Adder 332 then 
obtains the difference image from the input image stored 
in the input frame memory 330, and the predictive image 
output from the predictor 344; tiie resulting difference 
image is then coded by the coder 334. 

The coder 334 then divides tiie difference image into 
blocks using an image divider 359. It is to be noted tiiat 
while blocks of 16 x 16 pixels or 8 x 8 pixels are prefer- 
able, the image divider 359 can divide one frame image 
into blocks of any particular shape and size. Each block 
is then converted ty a discrete cosine transformation 
(DOT) operator 351 or other ortiiogonal transformation 
operation, and quantized by a quantizer 353. The quan- 
tized image is then output from the output terminal 348, 
and inverse quantized by an inverse quantizer 355 and 
inverse OCT converted by an inverse OCT 357 in a local 
decoder 336. 

The reverted difference image is then added to the 
predictive image by an adder 338 to generate the recon- 
structed image stored to a second frame memory 340. 
It should be noted that both the luminance signal and tiie 
transmittance signal are coded. Furthermore, while the 
transmittance signal is DOT converted identically to the 
luminance signal in this example, a different compres- 
sion method may be used. The coder 334 may also apply 
wavelet conversion or vector quantization in place of 
DOT conversion insofar as the reversion process exe- 
cuted by the local decoder 336 corresponds to the con- 
version process used by the coder 334. 
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The image used as the template is the image most 
closely resembling all images in the continuous image 
sequence, and can be selected either manually or auto- 
matically. Plural templates may also be selected for a sin- 
gle image sequence. Each template is also expressed 5 
by a luminance signal and transmittance signal, com- 
pression coded before transmission, and the reproduced 
template is then stored to the first frame memory 342. 
The reproduced template stored to the first frame mem- 
ory 342 is also updated at either regular or irregular inter- 
vals. It is also noted that the template can be a conplete 
image, or may comprise only a part of the complete 
image (e.g., an ear, eye, or mouth). It is also not neces- 
sary to transmit the template, in which case an image 
previously stored in the transmitter and receiver may be 
used as the template. 

The operation of the predictor 344 is described next 
with reference to Figs. 3 and 5 for the case in which frame 
2014 in Fig. 5 is predicted. Fig. 5 shows a simulation of 
the image predictive coding method. Note that frame 
2014 is the target image, i.e.. the image frame to be 
coded, frame 2010 is the tennplate, and frame 2012 is 
the image frame displayed chronologically before the tar- 
get image 2014. Each of these images 2010. 2012. and 
2014 is input to the predictor 344 shown in Fig. 3. spe- 
crfically inputting target image 2014. i.e.. the luminance 
signal and transmittance signal thereof, to terminal 372. 
the template 2010 to terminal 368, and the "previous" 
frame 2012 to terminal 370. Note that the "previous" 
frame 2012 is the decompressed, reconstructed image 
stored to the second frame memory 340 in Fig. 1, and 
the template 2010 is the similarly decompressed, recon- 
structed image stored to the first frame memory 342 in 
Fig.1. 

The target image 2014 and template 2010 are then 
input to a deformation and displacement calculator 350, 
which obtains the deformation and displacement param- 
eters, such as shift amount data, rotation degree data, 
contraction data, expansion data, curve data, etc. More 
specif icaliy. in the deformation and displacement calcu- 
lator 350. by comparing the template 2010 and the target 
image 2014. various parameters are produced that rep- 
resent amount of change from the template 2010 to the 
target image 201 4. For example, if the target image 201 4 
is shifted by a vector A and rotated by a degree 6 from 
the template 2010, parameters A and 6 are produced 
from the deformation and displacement calculator 350. 
An aff ine transformation is preferably used in the defor- 
mation and displacement calculator 350 to obtain the 
deformation and displacement parameters, but a trans- so 
formation containing a term of the second degree may 
be alternatively used. The parameters, i.e., affine coeffi- 
cients, obtained by the deformation arKi displacement 
calculator 350 are input together with the template 201 0 
to a predictive image generator 352. 55 

In the predictive image generator 352, a quasi target 
image (this may not be exactly the same as the target 
image 2014 applied to the terminal 372) is reformed 
using the template 201 0 and the parameters. The quasi 



target image, which is referred to as a first predictive 
image (luminance and transmittance signals), is gener- 
ated by means of an affine transformation. The structure 
and operation of the deformation and displacement cal- 
culator 350 and predictive image generator 352 are 
described in: "An Image Coding Scheme Using Layered 
Representation and Multiple Templates," M. Etoh, et al., 
Technical report of lEICE. IE94-159, PRU94-159, 1995; 
"Layered Representation of Image Sequence Coding." 
J. Wang and E. Adelson. Proc. IEEE Int. Conf. Acoustic 
Speech Signal Processing, 1993, pp. V221-V224; and in 
"Layered Representation for Motion Analysis," J. Wang 
and E. Adelson, Proc. Computer Vision and Pattern Rec- 
ognition, pp. 361 - 366, 1993, which are incorporated 
herein by reference. 

This first predictive image and the target image are 
then input to the difference calculator 354, which obtains 
a first square difference sum by the following equation 
(1): 

image 

in which Ty is a pixel value at position (i.D in the target 
image 2014, Qy is a pixel value at position (i.j) in the first 
predictive image (quasi target image), and "image" is a 
portion of the frame where the pixel values are to be 
coded or can be a complete one frame. The calculated 
sum is applied to the comparator 366. 

The affine coefficient of the target image 20 1 4 based 
on the "previous" frame 201 2 is likewise obtained by the 
corresponding deformation and displacement calculator 
356. and a second predictive image (quasi target image) 
is generated based on the "previous" frame 2012 by the 
corresponding predictive image generator 358. A differ- 
ence calculator 360 calculates the difference between 
the target image 2014 and the second predictive image 
from the predictive image generator 358 and produces 
a second square difference sum using equation (1). The 
difference sum is output to the comparator 366. 

The first and second predictive images are also input 
to an average calculator 362 to calculate a weighted 
averages of the respective luminance and transmittance 
signals between the first and second predictive images. 
Thus, the average calculator generates an average or a 
third predictive image. The difference calculator 364 then 
calculates a third square difference sum from this third 
predictive image and the target image, and outputs the 
result to the comparator 366. 

The comparator 366 thus compares the first, second 
and third square difference sums, detects the lowest 
sum, and controls the switches 376 and 376' to output 
the predictive image and its affine coefficients corre- 
sponding to the lowest difference value. More specifi- 
cally, if the difference between the first predictive image 
and the target image is smallest, i.e., Is smaller than tiiat 
between the second predictive image and the target 
image or smaller than that between the third predictive 
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image and the target image, the switch 376 is connected 
to pole 382 to output the first predictive image from ter- 
minal 374 and the switch 376* is connected to pole 382' 
to output the aff ine coefficients pertaining to the first pre- 
dictive image, that is, switches 376 and 376' produce a 
first data. The smallest difference is selected because 
the predicted image with the smallest difference is clos- 
est to the target image 2014. 

Similarly, if the difference between the second pre- 
dictive image from the image generator 358 and the tar- 
get image is detected to be the smallest, the switch 376 
is connected to pole 380 to output the second predictive 
image from terminal 374 and the switch 376' is con- 
nected to pole 380' to output the affine coefficients per- 
taining to the second predictive image, that is, switches 
376 and 376' produce a second data. If the difference 
between the third predictive image from the average cal- 
culator 362 and the target image is detected to be the 
smallest, the switch 376 is connected to pole 378 to out- 
put the third predictive image from terminal 374 and the 
switch 376' is connected to both poles 380' and 382' to 
output the affine coefficients pertaining to the first and 
second predictive images, that is, switches 376 and 376' 
produce a third or an average data. 

It is to be noted that while the difference calculators 
above obtain the sums of square difference values given 
by equation (1), it is also possible to obtain the sum of 
absolute difference values as given by the following 
equation (2). 

image 

It is also noted that average calculator 362 and 
the difference calculator 364 can be omitted to simplify 
the arrangement 

A further case in which frame 2018 in Fig. 5 is pre- 
dicted is described next In this exannple, frame 2018 is 
the target image, i.e., the image frame to be coded, frame 
2010 Is the template, and frame 2020 Is the Image frame 
displayed chronologically after tiie target image 2018. 

Each of these images 2010, 2018, and 2020 is input 
to the predictor 344 shown in Rg. 3. specifically inputting 
target image 2018, i.e., the luminance signal and trans- 
mittance signal thereof, to terminal 372. the template 
2010 to terminal 368, and the "future" or "next" frame 
2020 to terminal 370. Note that the "next" frame 2020 is 
the decompressed, reconstructed image stored to the 
second frame memory 340 in Fig. 1 . 

The first predictive image (luminance and transmit- 
tance signals) is generated by the deformation and dis- 
placement calculator 350 and predictive image 
generator 352 using the target and templates as 
described above. This first predictive image and the tar- 
get image are tiien input to the difference calculator 354, 
which obtains a first square difference sum from only the 
pixel values to be coded, arKi outputs to the comparator 
366. 



The second predictive image is likewise generated 
by the deformation and displacement calculator 356 and 
predictive image generator 358 using tiie target Image 
and the "next" frame 2020. The second square difference 

5 sum is obtained by the difference calculator 360. and the 
result is output to the comparator 366. 

The first and second predictive images are also input 
to the average calculator 362 to calculate the weighted 
averages of the respective luminance and transmittance 

10 signals and generate a third predictive image. The differ- 
ence calculator 364 then obtains a third square differ- 
ence sum from this third predictive image and the target 
image, and outputs to the comparator 366. 

The comparator 366 thus compares tiiese first, sec- 

15 ond, and third sums of square difference values, and 
controls the switches 376 and 376' to output the predic- 
tive image con-esponding to the lowest difference value 
as described above. 

A further case in which frame 2022 in Fig. 5 is pre- 

20 dieted or coded is described next. In tiiis case, a predictor 
344' shown in Fig. 4 is used. When compared with the 
predictor 344 of Fig. 3, the predictor 344' further has 
another set of a deformation and displacement calculator 
386, a predictive image generator 390 and difference cal- 

25 culator 392. Also, the average calculator 362' is provided 
to calculate a weighted averages of the respective lumi- 
nance and transmittance signals between the predictive 
images from the predictive image generators 390 and 
358. It is noted that the average calculator can be so con- 

30 nected as to calculate a weighted averages of the 
respective luminance and transmittance signals 
between any two predictive images or among the three 
predictive images from the predictive image generators 
352, 390 and 358. When tiie comparator 366 selects the 

35 difference from the difference calculator 364 as the 
smallestdifference. switch 376' is so connected as to out- 
put affine coeff icients pertaining to the predictive images 
applied to the average calculator 362. 

For coding the frame 2022 shown in Rg. 5, the frame 

40 2022 is the target image, i.e., the image frame to be 
coded, frame 2010 is the template, frame 2020 is the 
image frame displayed chronologically before the target 
image 2022, and frame 2024 is tiie image frame dis- 
played chronologically after the target image 2022. 

45 Each of these images 2010, 2020, 2022, and 2024 
is input to the predictor 344 shown in Fig. 4, specifically 
inputting target image 2022, i.e., the luminance signal 
and transmittance signal thereof, to terminal 372, tiie 
template 20 1 0 to terminal 368, tiie "previous" frame 2020 

50 to terminal 370, and tiie "next" frame 2024 to terminal 
386. Note that the "previous" and "next" frames 2020 and 
2024 are the r^roduced images of previously compres- 
sion coded image data. 

The first predicta've image (luminance and transmit- 

55 tance signals) is generated by the deformation and dis- 
placement calculator 350 and predictive image 
generator 352 from the target and tenplates as 
described above. This first predictive image and the tar- 
get image are then input to the difference calculator 354, 
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which obtains a first square difference sum using only 
the pixel values to be codedp and outputs to the conipa- 
rator 366. 

The second predictive Image is likewise generated 
from the "previous" frame 2020 by the deformation and s 
displacement calculator 356 and predictive image gen- 
erator 358, the second square difference sum is obtained 
by the difference calculator 360, and the result is output 
to the comparator 366. 

A third predictive image is similarly generated from io 
the target image and the "next" frame 2024 by the defor- 
mation and displacement calculator 388 and predictive 
image generator 390, a third square difference sum is 
obtained by the difference calculator 392, and the result 
is output to the comparator 366. is 

The second and third predictive images are also 
input to the averaging means 362 to obtain the weighted 
averages of the respective luminance and transmittance 
signals, and generate a fourth predictive image. The dif- 
ference calculator 364 then obtains a fourth square dif- 20 
ference sum from this fourth predictive image and the 
target image, and outputs to the comparator 366. 

The comparator 366 thus compares these first, sec- 
ond, third, and fourth sums of square difference values, 
and controls the switches 376 and 376' to output the pre- 25 
dictive image corresponding to the lowest difference 
value. 

Referring to Fig. 6, an image predictive coding appa- 
ratus according to a second embodiment of the present 
invention is shown. As shown in Fig. 6. image dividers 30 
335 and 337 are disposed after the frame memory 330 
and predictor 344, respectively, and before the adder 332 
to divide the target and predictive images into image 
blocks. The target image blocks and the predictive image 
blocks are input to the adder 332 from input lines 514 35 
and 515, respectively, and the adder 332 obtains the dif- 
ference signal from corresponding target and predictive 
image blocks. Note that coder 334 in this embodiment 
has no image divider 359. 

In the embodiments described above, the predictive 4o 
image is adaptively selected and coded for the entire 
area of the input (target) image. Rather than selecting 
the predictive image for the entire image, however, it is 
also possible to divide the image area to be coded into 
plural subareas (blocks), and adaptively select and code 45 
an optimum predictive block for each image block The 
preferable block size is 1 6 x 1 6 pixels or 8 x 8 pixels, but 
the image can be divided into blocks of any particular 
shape and size. 

The basic coding apparatus is the same as shown so 
in Fig. 1, except that an image divider 331 shown by a 
dotted line in Fig. 1 is further provided before terminal 
372. and the input image is separated into blocks before 
being input for coding. The predictor 344 as shown in 
Fig. 3 or Fig. 4 is used, except that in this case the affine ss 
coefficient must be obtained not for a frame but only a 
portion of a frame. Thus, the images produced from said 
first and second frame memories 342 and 340 can be a 
block size equal to the block produced from said image 



divider 331 . Thus, the size of each of the first and second 
frame memories can be smaller than one frame. Note 
that rotation and other conversion processes may be 
replaced by a simple parallel motion detection and com- 
pensation process. Block unit motion detection and com- 
pensation processes are described in U.S. Patent Nos. 
5,193,004 and 5,157,742 which are incorporated herein 
by reference. The present Invention differs from these 
two references in that the predictive signal is generated 
by referring to a template. 

Note that the second embodiment is the same as 
the first embodiment in all respects except that predictive 
Image generation and processing is executed on an 
image block basis. 

Rgs. 7A and 7B show the coded data structure mul- 
t'plexing data generated by a predictive coding method 
using plural templates. Fig. 7A shows a sequence of 
Images G1 to Q8 of a fish swimming. Images G1 to G8 
in this sequence are predicted using two templates T1 
andT2. 

The first step is to reproduce the templates by one 
of the methods of the compression code described 
above. Each of the images G1 to G8 is then generated 
by predictive approximation based on the deformation 
and displacement of the generated template, preferably 
using an affine transformation. The result is the coded 
data for templates T1 and T2, and each of the sequence 
images G1 to G8. A specific example of the coded data 
is shown in Fig. 128. As shown in Fig. 12B and will be 
described below, each block contains coded motion 
information 1016. quantization amount 1017, and DCT 
coefficient 1018. 

Rg. 7B illustrates the transfer method for the coded 
representative-image data of the present invention. As 
shown in Fig. 7B, the coded data for sequence images 
G1 to Q8 (1054 - 1061 in Fig. 7B) is transferred after 
transferring all template data 1051, 1052. and 1053. 
Transferring all template data first offers the benefit of 
being able to reproduce the Image sequence starting at 
any point in the sequence. Because the templates T1 
and T2 are received and reproduced first by the receiver, 
reproduction of the image sequence can begin without 
delay from any frame G1 to G8, thus Improving the use- 
fulness of special reproduction modes (such as fast for- 
ward and reverse). 

Rg. 7C shows an alternative transfer method for the 
coded template data according to the present invention. 
As shown in this example, the template data is trans- 
ferred immediately preceding the image frames for which 
the template data is referenced for reproduction. Thus, 
because template T1 is referenced to reproduce images 
G1 , G2. and G3, but template T2 is not. the coded data 
1062 for template T1 is transferred first, followed imme- 
diately by the coded data (1063, 1064, and 1065) for 
images G1 , G2. and G3. which is followed by the coded 
data 1066 for template T2. This means that the coded 
template data is multiplexed to the data stream so that 
coded data for unreferenced templates is not transferred 
before the motion picture data. This transfer method 
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makes it possible to reduce the delay time until the first 
image G1 can be reproduced, and reduces the frame 
memory requirements because it is not necessary to 
store all of the reproduced templates. 

Referring to Fig. 8, an image predictive coding appa- s 
ratus according to a third embodiment of the present 
invention Is shown. A numeric example of the Image pre- 
diction method used In the apparatus of Fig. 8 Is shown 
in Fig. 9. 

The image predictive coding apparatus shown in io 
Fig. 8 is essentially identical to that shown in Fig. 6. What 
differs is on the Input side of the adder 332. More spe- 
cifically, a pixel value substitution operation (pad opera- 
tion) is accomplished before the difference value is 
obtained by the adder 332. is 

The output (luminance and transmrttance signals) 
from the input frame memory 330 is first separated into 
plural blocks by image divider 341 . The preferable size 
of these image blocks Is 4 x 4 pixels, but the invention 
shall not be so limited. The output (luminance and trans- 20 
mittance signals) from the predictor 344 is likewise 
divided into plural blocks of 4 x 4 pixels by image divider 
335. The output of the image divider 335 thus comprises 
the luminance signal blocks and the transmittance signal 
blocks, which are input to a padding value generator 2s 
(substitute pixel value generator) 329. 

The padding value generator 329 calculates the 
substitute pixel values by applying a known function to 
the pixels of greater than 0% transmittance (i.e., the pix- 
els to be coded, corresponding to tiie black parts of trans- 30 
mittance signal 1000). The average of ail pixels of less 
than 100% transmittance is preferably used, but the 
invention shall not be so limited. 

The substitute pixel values are then input to pads 
(pixel value substitution means) 339 and 343 whereby 35 
the substitute pixel values are substituted for the pixel 
values (of the luminance signal) of 0% transmittance. 
The difference signal of the luminance signal blocks out- 
put from the pads 339 and 343 is tiien obtained by the 
adder 332. Note that the difference signal of the trans- 4o 
mittance signal block values is obtained without making 
any substitution. 

Figs. 9A, 9B and 9C show a numeric example of the 
above operation. Fig. 9A shows the transmittance signal 
blocks. Block 21 10 is output by the image divider 341 4S 
downstream from the input frame memory 330, and 
block 2112 is output by the image divider 335 down- 
stream from tiie predictor 344. Pixels shown with a value 
of 0 (zero) in blocks 21 10 and 21 12 indicate the pixel 
values not to be coded (having 1 00% transmittance). To so 
retain the transmittance information, the difference 
between blocks 2110 and 2112 is obtained directly, 
resulting in block 2114 output to the coder 334. 

Rg. 98 shows the luminance signal blocks. Block 
21 16 is output from image divider 341 , and block 21 18 ss 
is output by image divider 335. The difference block 
obtained directiy from these blocks is block 2120. The 
large difference values shown in block 21 20 are a result 
of a mismatch between the contours of the pixels to be 
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coded. Blocks 21 12 and 21 1 8 are input to the padding 
value generator 329 to obtain the average of tiie pixel 
values in block 21 1 8 corresponding to the pixels in block 
2112 having a value of zero. In this example, the average 
pixel value (substitute pixel value) is 49. Alternatively, 
blocks 2110 and 21 1 8 are input to padding value gener- 
ator 329. 

The substitute pixel value 49 is then input to the pads 
339 and 343, obtaining blocks 2122 and 2124 by substi- 
tuting this substitute pixel value of 49 for the pixels having 
a transmittance signal value of 1. Block 2126 is then 
obtained by calculating the difference of blocks 2122 and 
2124. Note that the difference value of block 2126 is 
clearly smaller than tiiat of block 21 20. By inputting block 
2126 in place of block2120 to the coder 334. coding can 
be accomplished using fewer bits. 

Note that the substitute pixel values are obtained 
from the prediction blocks, and it is therefore not neces- 
sary to transfer the substitute pixel values (because the 
substitute pixel values can be recalculated on the receiv- 
ing side). Furthermore, values obtained by applying a dif- 
ferent known function, or a predetermined known value, 
may be substituted rather than using the average value 
described above insofar as the same value is substituted 
to both tiie target block and the prediction block, and tiie 
function or substitute pixel value is known on the receiv- 
ing side. 

Rg. 10 is a block diagram of an image decoder 12 
used in any one of digital Image decoding and synthe- 
sizing apparatuses of Figs. 11, 13-1 7A. The data input 
to this decoder 1 2 is generated by the method described 
above. More specifically, an input image defined by a 
luminance signal expressing the pixel luminance value, 
and a transmittance signal expressing the pixel transpar- 
ency value (state), is divided into plural target Omage) 
blocks. The luminance and transmittance signals of each 
prediction block are then determined for the luminance 
and transmittance signals of the corresponding target 
block. For each target block containing at least one pixel 
value that should not be coded, a substitute pixel value 
is calculated by applying a known function to the predic- 
tion block pixel values to be coded. This substitute pixel 
value is tiien substituted into tiie target block and the cor- 
responding prediction block for each pixel value therein 
not to be coded, and the difference between the target 
block and the prediction block is obtained using the sub- 
stitute pixel values to code the image. The Image decod- 
ing apparatus shown in Fig. 10 and described below 
decodes the image data thus coded. 

As shown in Fig. 1 0, the decoder 1 2 preferably com- 
prises an input terminal 1 0, a parser 1 1 . an inverse quan- 
tizer 13, an inverse DOT operator 15, an adder 17. a 
frame memory bank 324. a prediction block generator 
(address generator) 21 , a padding value generator 321 , 
and a pad 319. The parser 1 1 , Inverse quantizer 1 3, and 
inverse DOT operator 15 defines main part of the 
decoder. 

The image data is input to the input terminal 1 0, and 
decoded by the decoder to generate the difference sig- 
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nal. More specifically, the parser 1 1 parses the image 
data, and outputs the motion information over line 130 
to address generator 21. and the quantization amount 
and quantization DOT coefficient over line 116 to the 
inverse quantizer 1 3. The inverse quantizer 1 3 multiplies 5 
the quantization DCT coefficient by tfie quantization 
amount to generate the inverse quantization DCT coef- 
ficient. 

The inverse quantization DCT coefficient is sent 
over line 1 1 8 to the inverse DCT operator 1 5 for Inverse 10 
conversion to a spatial domain difference signal. The dif- 
ference signal is sent over line 120 to the adder 17 for 
combination witii the prediction signal input from line 1 29 
to generate and output the reconstructed signal. This 
reconstructed signal (luminance and transmrttance sig- is 
nals) is also fed back to tiie frame memory bank 324 over 
line 124. The operation of the frame memory bank 324 
is described below. 

The address generator 21 converts the supplied 
motion information to the address used to access tiie 20 
frame memory bank 324, and the prediction signal block 
is read from tiiat address in the frame memory bank 324. 
The prediction block is then input over line 126 to the 
padding value generator 321. which applies a known 
function to the pixel values that should be coded in the 2S 
prediction block to generate the substitute pixel value. 
Note that the function used here must be tiie same as 
the function used on tiie ti^nsmission side. The preferred 
substitute pixel value is the average of the pixel values 
that should be coded. 30 

The substitute pixel value and the prediction block 
are then input to the pad 319, which replaces the pixel 
values not to be coded in the prediction block with the 
substitute pixel value, and then outputs over line 129 to 
the adder 17. 35 

The image objects thus reproduced can then be syn- 
thesized based on the transmittance information, and 
displayed. This process is described below with refer- 
ence to the following embodiments. 

Multiplexing must be used to compression code and 40 
transfer an image comprising plural layers as shown in 
Fig. 2. The digital image transfer operation is described 
below with reference to Figs. 12A, 12B and 12C. 

Rg. 12A shows the multiplexed structure of coded 
data comprising three layers (data 1 0 1 0 for layer A. data 4S 
1 0 1 1 for layer B. and data 1 0 1 2 for layer C) defining one 
image frame (frame 1). Data 1010 for layer A, data 101 1 
for layer B, and data 1 0 1 2 for layer C correspond to image 
A 1000, image B 1001. and image C 1002 shown in Fig. 
2. respectively. The data structure shown in Fig. 12A is so 
therefore obtained by multiplexing the image layers in 
sequence from the foreground layer. A variation of this 
structure would be to multiplex the layers in sequence 
from tiie bad^round layer, i.e., data 1023 for layer C, 
data 1 024 for layer B, and data 1 025 for layer A, as shown ss 
in Fig. 12C. 

The layer data starts witii a layer synchronization 
signal 1015, followed by a block synchronization signal 
1014. the motion information 1016 for that block, the 
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quantization amount 1017, and ttie DCT coefficient 
1 01 8. as shown in Rg. 1 2B. When a transmittance signal 
is present, the transmittance signal coefficient is also 
included in addition to the luminance signal DCT coeffi- 
cient. 

it is to be noted that while the preceding embodi- 
ments of the invention have been described using motion 
compensation DCT, sub-band, wavelet, or fractal coding 
may be alternatively used. In addition, multiplexing by 
image layer has been described above, but the layer data 
may also be divided into blocks of a known bit size, and 
the blocks tiien packetized for multiplexing. In tiiis case, 
the data may be multiplexed in a sequence such as layer 
A packet 1 . layer B packet 1 , layer C racket 1 . layer A 
packet 2, layer B packet 2. layer C packet 2.., 

The decoding and synthesizing apparatus accord- 
ing to the first embodiment of tiie present invention is 
described next witii reference to the block diagram in Rg. 
1 1 . The coded data multiplexed using the format shown 
in eitiier Fig. 12A or 12C is input to external line input 
terminal 10. The decoder 12 tfien decodes the coded 
data in each layer to a reconstructed image, which is 
stored to the frame memory FMO 1 4. The image synthe- 
sizer 16 overlays the reconstructed image with the syn- 
thesized image stored to the feedback frame memory 
FM1 22 to generate a new synthesized image. This new 
synthesized image is then stored to the frame memory 
FM1 22. and to the display memory 1 8 for display by the 
output device 20. Process controller 24 controls both the 
frame memory FMO 14 and the frame memory FM1 22. 

The preferred embodiment of the decoder 12 is 
shown in Fig. 10, and is described below for the case in 
which the coded data multiplexed as shown in Fig. 12C 
is decoded and syntiiesized. 

The coded data 1023 for layer C (tiie background 
layer) is first input to the external line input terminal 10. 
The parser 11 tiien parses the image data, and outputs 
the motion information over line 130 to address genera- 
tor 21. and the quantization amount and quantization 
DCT coefficient over line 1 1 6 to the inverse quantizer 1 3. 
The inverse quantizer 1 3 multiplies the quantization DCT 
coefficient by he quantization amount to generate the 
inverse quantization DCT coefficient. 

The inverse quantization DCT coefficient is sent 
over line 118 to the inverse DCT operator 15 for inverse 
conversion to a spatial domain difference signal. The dif- 
ference signal is sent over line 120 to tiie adder 17 for 
combination with the prediction signal (after pixel value 
substitution) input from line 129 to generate and output 
the reconstructed signal. This reconstructed signal is 
also fed back to the frame memory bank 324 over line 
124. The frame memory bank 324 comprises plural 
frame memory units. 

In this example, the frame memory t>ank 324 com- 
prises three frame memory units 19a. 19b. and 19c. 
Frame memory 19c is used to store the layer C image in 
this example. To read the layer C image from the frame 
memory bank 324, switch 80 is connected to terminal 
83c, switch 81 to terminal 84c. and switch 82 to terminal 
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85c. The address generator 21 converts the motion infor- 
mation to a frame memory address to read the prediction 
signal from the assigned frame memory 19c, which thus 
outputs layer C over line 126 to the padding value gen- 
erator 32 1 . 5 

The decoded layer C innage is then output to the 
frame memory FMO 14 in Rg. 11. Since layer C is the 
first layer in the frame in this example, the process con- 
troller 24 sets the frame memory FM1 22 to a known 
value. This known value indicates that the frame memory io 
value is not a valid pixel, and is preferably set to zero. 
The process controller 24 also outputs the pixel values 
for the same location from frame memories 1 4 and 22 to 
the image synthesizer 16 in the scanning sequence. 

The process executed by the image synthesizer 16 is 
is described next. The image stored to frame memory 
FMO 14 is drawn on top of the image written to frame 
memory FM1 22 because the image in frame memory 
FMO 14 is the image layer above the layer of the image 
in frame memory FM1 22. The image synthesizer 16 20 
compares the pixels input from the frame memory FMO 
14 with the corresponding pixels in the same position in 
the image input from the frame memory FM1 22. If the 
pixel value in the image from the frame memory FMO 1 4 
is not a particular value (zero in this example), the pixel 25 
from the frame memory FMO 14 is output; if the pixel 
value in the image from the frame memory FMO 14 is a 
particular value (zero in this example), the pixel from the 
frame memory FM1 22 is output. The output pixels are 
buffered to the display memory 1 8 for display to the out- 30 
put device 20, and simultaneously fed back to the frame 
memory FM1 22 for storage. 

The coded data 1024 of layer B in Fig. 12C is next 
input to the external line input terminal, decoded as 
described above to the reconstructed image by the 35 
decoder 12, and input to the frame memory FMO 14. 
Frame memory 19b in the decoder 12 is used to store 
the layer B image in this example. To read the layer B 
image from the frame memory bank 324, switches 80, 
81 , and 82 are therefore respectively switched to termi- 40 
nals 83b, 84b, and 85b, and the image layer is thus output 
and decoded. The images from frame memory FMO 14 
and frame memory FM1 22 are then overlaid by the 
image synthesizer 16 as described above, and output to 
the display memory 18 and frame memory FM1 22. As 4S 
a result, the layer B image is overlaid to the layer C 
image. 

The coded data 1024 of layer A in Fig. 12C is next 
input to the external line input terminal, decoded as 
described above by the decoder 1 2. and overlaid to the so 
image from frame memory FM1 22 by the image synthe- 
sizer 16. Frame memory 19a in the decoder 12 is used 
to store the layer A image in this example. To read the 
layer A image from the frame memory bank 324. 
switches 80, 81, and 82 are therefore respectively ss 
switched to terminals 83a, 84a, and 85a. The image for 
layer A is thus output and decoded, and can be overlaid 
on top of the layer B and C images for display. 
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This process is then repeated for the next frame. It 
is to be noted that while tiie present embodiment has 
been described using three layers, the same essential 
process can be applied to any number of image layers. 
Note, further, that the number of frame memory units in 
the frame memory bank 324 of tiie decoder 1 2 must cor- 
respond to the number of image layers. 

When the image data is coded using a format as 
shown in Rg. 12A, the process executed by the image 
syntiiesizer 16 must be modified. Specifically, tiie image 
layers stored to the frame memory FM1 22 are tiie 
images to be overlaid on top of the images stored to the 
frame memory FMO 1 4 because the image is multiplexed 
in sequence from the foreground layer. The image syn- 
thesizer 16 again compares the pixels input from the 
frame memory FMO 14 with the con-esponding pixels in 
the same position in the image input from the frame 
memory FM1 22. In this case, however, if the pixel value 
in the image from the frame memory FM1 22 is not a 
particular value (zero in this example), the pixel from the 
frame memory FM1 22 is output; if tiie pixel value in the 
image from the frame memory FM1 22 is a particular 
value (zero in tNs example), the pixel from the frame 
memory FMO 14 is output. Thus, the layer received last 
can be correctiy placed below the layer received first. 

Rg. 13 is a block diagram of a second embocfiment 
of a digital image decoding and synthesizing apparatus 
accofding to the present invention. This embodiment is 
essentially identical to that shown in Fig. 1 1 . What differs 
is that a switch 26 is inserted between the input terminal 
10 and the decoder 12, and a display selector 25 is 
added. 

The switch 26 is controlled by the process controller 
24 via line 113. When the image decoding and synthe- 
sizing apparatus is not reproducing and displaying a psu-- 
ticular layer, the switch 26 is open, thus interrupting the 
signal line between the input terminal 1 0 and the decoder 
12. The process controller 24 can control the switch 26 
to prevent processing a particular layer when there is not 
sufficient processing capacity available by evaluating the 
available processing capacity of the apparatus. Using 
tiie display selector 25, it is also possible for the user to 
control the operation of tiie switch 26 to reproduce and 
display only a particular layer. 

The display selector 25 may be a keyboard, mouse, 
or other interface device. To not display the background 
layer (layer C in Rg. 2), for example, the corresponding 
layer synchronization signal is detected by a layer syn- 
chronization signal detector (not shown in the figures), 
and the switch 26 is controlled to open when the syn- 
chronization signal for layer C is detected, thereby pre- 
venting layer C from being processed 

Rg. 14 is a block diagram of a third embodiment of 
a digital image decoding and synthesizing apparatus 
according to the present invention. This embodiment is 
essentially identical to that shown in Fig. 1 3. What differs 
is that another switch 34 is inserted between the image 
synthesizer 16 and the frame memory FM1 22, and a 
switch, 30 is inserted between the image synthesizer 16 
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and the display memory 18. Both of these switches 34 
and 30 are controlled by the process controller 24, and 
can be interactively controlled by the user using the dis- 
play selector 25. 

When the image stored to the frame memory FM1 s 
22 is not to be updated, the process controller 24 opens 
the switch 34 to disconnect the image synthesizer 16 
from the frame memory FM1 22. Ukewise, when the 
image synthesizer output is not to be displayed, the cor- 
responding switch 30 is opened to disconnect the image io 
synthesizer 1 6 from the display memory 1 8. 

This process is described below for the case in 
which image data multiplexed with layers A (foreground). 
6 (middle layer), and C (background) is input and repro- 
duced and displayed for a period of time because the is 
three switches 26. 30, and 34 are closed. The user then 
operates the display selector 25 to bring layer B to the 
foreground while freezing the display of layers A and C. 

Fig. 18 shows a table indicating the different posi- 
tions of each switch and the corresponding change in the 20 
content of the feedback frame memory FM1 22 and the 
display memory 18 for the case in which the coded data 
is sequentially input from the bad^round layer, i.e.. is 
input and decoded in the sequence CI , B1 , A1 , C2, B2, 
A2, ... Note that in Fig. 1 8, ON represents a closed switch 25 
position (continuity exists), and OFF represents an open 
switch position (continuity does not exist). The display 
selection signal is assumed to be input through the dis- 
play selector 25 at the beginning of frame 2. 

Both switches 26 and 34 are closed when frame 1 30 
is input, allowing the content of frame memory FM1 22 
to be continuously updated and resulting in a synthe- 
sized image of layers A1 + B1 + CI . Switch 30 is open 
until the synthesized image of layers A1 + B1 + CI is 
output, at which time switch 30 closes to update the 35 
image comprising layers AO + BO + CO stored in the dis- 
play memory 18 to be updated to the image comprising 
layers A1 + B1 + CI . The image from frame 1 continues 
to be displayed during the period for frame 2 because 
the desired image (frame 2 image) is still being proc- 4o 
essed. As a result, the switch 30 remains open and the 
display memory 1 8 continues to store the frame 1 image. 

In order to synthesize an image from layers C2 and 
A2, switches 26 and 34 are held closed while layers C2 
and A2 are input, but are opened when layer B2 is input 45 
More specifically, switches 26 and 34 are closed when 
layer C2 is input, opened when layer B2 is input, and then 
closed again when layer A2 is input. As a result, only lay- 
ers C2 and A2 are written to the frame memory FM1 22. 
Switch 34 is then opened so that the frame memory FM1 so 
22 is not updated and holds the image synthesized from 
layers A2 + C2. 

When frame 3 and subsequent frames are input, 
switches 26 and 34 are closed (ON) only during the 
period for layer B(B3, B4, ...). thereby enabling decoding, ss 
reconstruction, synthesis, and display of layer B with the 
synthesized image of layers A2+C2 after buffering to the 
display memory 1 8. It is thus possible to display layer B 
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in the foreground while freezing display of the other lay- 
ers in a still mode. 

Fig. 15 is a block diagram of a fourth embodiment of 
a digital image decoding and synthesizing apparatus 
according to the present invention. This embodiment is 
essentially identical to that shown in Fig. 1 4. What differs 
is that another frame memory FM2 23 is added to com- 
plement frame memory FM1 22, and switches 34 and 31 
are disposed to select the input and output of these two 
frame memory units. Another switch 38 is also provided 
to switch the input to the image synthesizer 16 between 
frame memory FMO 14 and frame memory FM2 23. 
These switches are controlled by the process controller 
24, and can be interactively controlled by the user using 
the display selector 25. 

This embodiment is described below with reference 
to a table in Fig. 1 9. As in the table of Fig. 1 8, ON repre- 
sents a closed switch position (continuity exists), and 
OFF represents an open switch position (continuity does 
not exist). In addition, "P" represents a terminal (switch 
pole) such that •'P40" shown in the line for switch 38 
means that switch 38 is connected to terminal (pole) 40 
(continuity exists through switch 38 to terminal 40). 

it is assumed that frame 1 is normally decoded and 
synthesized, i.e., layer B1 is overiaid to layer CI, and 
layer A1 to layer B1 . To achieve this, switch 26 is closed, 
and switches 38. 34, and 31 , respectively, are connected 
to terminals 40, 29, and 35. Switch 30 closes after the 
synthesized image of layers A1 + B1 + C1 is obtained to 
output tiie synthesized image through the display mem- 
ory 18 to the output device 20. 

A display selection signal instructing layer B to be 
displayed in the foreground while the other layers are 
also refreshed and displayed is then input from the dis- 
play selector 25 at the beginning of frame 2. Because 
each layer Is to be reproduced and displayed, switch 26 
remains closed. Both frame memories FM1 22 and FM2 
. 23 are set to a particular value at the beginning of frame 
1. 

Layer C2 is then decoded and output through switch 
38 from the frame memory FMO 1 4 to the image synthe- 
sizer 16. The output from the image synthesizer 16 is 
routed through switch 34 for storage to the frame mem- 
ory FM1 22. 

Layer B2 is then decoded and output through switch 
38 from the frame memory FMO 1 4 to the image synthe- 
sizer 16, combined thereby with the image held in the 
frame memory FM2 23 (which was previously set to a 
particular value), and then routed by switch 34 for stor- 
age to the frame memory FM2 23. 

Layer A is then similarly decoded, but is synthesized 
with the layer C2 image stored to the frame memory FM1 
22. The selector switch 31 is therefore switched to termi- 
nal 35, and the synthesized image is routed through 
switch 34 for storage to the frame memory FM1 22. As 
a result, the synthesized image A2 + C2 is stored to 
frame memory FM1 22, and the layer B2 image is stored 
to frame memory FM2 23. Switches 38 and 31 are then 
switched to terminals 39 and 35, respectively, to synthe- 
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size layer B2 with layers A2 + C2 for display By repeating 
this process for each layer in the following frames, a 
motion picture in which layer B is displayed in the fore- 
ground can be obtained. 

While the above description refers specifically to s 
coded data multiplexed in sequence from the back- 
ground layer, the same essential process is used to proc- 
ess coded data multiplexed in sequence from the 
foreground layer. 

Fig. 1 6 is a block diagram of a fifth embodiment of a io 
digital image decoding and synthesizing apparatus 
according to the present invention. This embodiment is 
essentially identical to that shown in Fig. 1 1 . What differs 
is that another frame memory FM3 42 is added for 
decoding and synthesizing coded data containing a is 
transmittance signal. In the example below, the coded 
data to be decoded and synthesized is multiplexed in 
sequence from the background layer as shown in Fig. 
120. The basic operation is as described above with 
respect to Rg. 1 1 , except that the luminance and trans- 20 
mittance signals of the reconstructed image in each layer 
are stored to the frame memory FM0 1 4 and frame mem- 
ory FM3 42, respectively, and then merged with the syn- 
thesized image supplied from the feedback frame 
memory FM1 22. 2s 

Using the transmittance signal stored to the frame 
memory FM3 42, the image synthesizer 16 merges the 
image in the frame memory FMO 14 with the image in 
the frame memory FM1 22. If xO is the pixel value from 
the frame memory FMO 1 4, x1 is the pixel value from the 30 
frame memory FM1 22, and a is the value from the frame 
memory FM3 42, an opaque object is present and the 
background cannot be seen when a is 100%; when a is 
0%, there is no object present, and the background is 
displayed through the other layers. The synthesized pixel 35 
value y can thus be obtained from the equation 

y = a*xO + (1 - a) *x1. 

It is thus possible to place the foreground image received 4o 
after the other layers over the other layers, and when a 
is between 0% and 100%, the transparency of the cor- 
responding objects can be appropriately adjusted. Note 
that a selective display capability can be provided in the 
apparatus shown in Fig. 16 by adding switches and 45 
frame memory as shown in Rgs. 13, 14, and 15 and 
described above. 

Fig. 17A is a block diagram of a sixth embodiment 
of a digital image decoding and synthesizing apparatus 
according to the present invention. Fig. 17A is a digital so 
image decoding and synthesizing apparatus for decod- 
ing and synthesizing coded data containing a transmit- 
tance signal as described above with reference to Rg. 
1 6. In this case, however, the coded data to be decoded 
and synthesized is multiplexed in sequence from the ss 
foreground layer as shown in Fig. 12 A. An image synthe- 
sizer 46 for synthesizing the transmittance signals stored 
to the frame memory FM3 42. and an additional frame 



memory FM4 44 for storing the result of transmittance 
signal synthesis, are also provided. 

The basic process executed by this apparatus is the 
same as that described above with reference to Fig. 16, 
except that the processes executed by the image syn- 
thesizers 16 and 46 is modified because the images are 
decoded in sequence from the foreground layer. 

The luminance and transmittance signals for layer A 
are first decoded, and stored to the frame memory FMO 
14 and frame memory FM3 42, respectively. Layer A is 
not synthesized because it is the first layer in the frame, 
and the luminance and transmittance signals are there- 
fore stored to the frame memory FM1 22 and frame 
memory FM4 44. A transmittance signal A* as shown in 
Fig. 17B is stored to the frame memory FM4 44. 

Layer B is then decoded, and the luminance and 
transmittance signals are stored to the frame memory 
FMO 14 and frame memory FM3 42. Using the transmit- 
tance signal (image 1004) stored in frame memory FM4 
44, the image synthesizer 16 combines the luminance 
signals from frame memory FMO 14 and frame memory 
FM1 22. If xO is the pixel value from the frame memory 
FMO 14. x1 is the pixel value from the frame memory 
FM1 22, and a is the value from the frame memory FM4 
44. x1 is placed over xO, and the output pixel Y from the 
image synthesizer 1 6 can be obtained from the equation 

y = a * x1 + (1 - a) * xO. 

Transmittance signal B' 1005 (Fig. 17B) is stored to 
the frame memory FM3 42, and is merged with the trans- 
mittance signal 1004 from the frame memory FM4 44 by 
the image synthesizer 46. The transmittance signals 
from the two frame memories FM3 42 and FM4 44 are 
preferably compared, and the signal with the higher sig- 
nal value is output. The transmittance signals thus syn- 
thesized are shown as image D* 1006 in Fig. 17B. 

The luminance and transmittance signals for layer C 
are decoded next, and merged with layers A and B as 
described above. 

Note that a selective display capability can be pro- 
vided in the apparatus shown in Fig. 17B by adding 
switches and frame memory as shown in Figs. 13, 14, 
and 15 and described above. 

The invention being thus described, it will be obvious 
that the same may be varied in many ways. Such varia- 
tions are not to be regarded as a departure from the spirit 
and scope of the invention, and a\\ such modifications as 
would be obvious to one skilled in the art are Intended 
to be included within the scope of the following claims. 

Claims 

1. An image coding method for coding a target image 
in a sequence of images, each image comprising a 
luminance signal expressing each pixel value, and 
a transmittance signal expressing the transparency 
of each pixel value, said method comprising the 
steps of: 



23 



EP0 719 049 A2 



24 



preparing a tempiate which is a representa- 
tive of the images in sequence; 

generating a first predictive image by predict- 
ing said target image based on said template, and 
producing first data related to said first predictive 5 
image; 

producing a difference between said first pre- 
dictive image and said target image; 

generating a second predictive image by pre- 
dicting said target image based on a first adjacent 10 
image which is to be displayed chronologically adja- 
cent on one side of said target image, and producing 
second data related to said second predictive 
image; 

producing a difference between said second is 
predictive image and said target image; and 

selecting one of said first and second data 
that has produced the smallest difference with 
respect to the target image. 

20 

2. An image coding method according to Claim 1, 
where in said first adjacent image is chronologically 
before said target image. 

3. An image coding method according to Claim 1. 2s 
where in said first adjacent image is chronologically 
after said target image. 

4. An image coding method according to Claim 1. fur- 
ther comprising the steps of: 30 

generating a third predictive image by pre- 
dicting said target image based on a second adja- 
cent image which is to be displayed chronologically 
adjacent on the other side, opposite to said one side, 
of said target image, and producing third data 3s 
related to said third predictive image; 

produdng a difference between said third 
predictive image and said target image; and 

said selecting step selecting one of said first, 
second and third data that has produced the small- 40 
est difference with respect to the target image. 

5. An image coding method according to Claim 1. fur- 
ther comprising the steps of: 

generating an average predictive image by 4S 
obtaining a weighted average of said first and sec- 
ond predictive images, and producing an averaged 
data related to said average predictive image; 

producing a difference between said average 
predictive image and said target image; and so 

said selecting step selecting one of said first, 
second and average data that has produced the 
smallest difference with respect to the target image. 

6. An image coding method according to Claim 4. fur- ss 
ther comprising the steps of: 

generating an average predictive image by 
obtaining a weighted average of at least two of said 
predictive images, and producing an averaged data 



related to said average predictive image; 

producing a difference between said average 
predictive image and said target image; and 

said selecting step selecting one of said first, 
second, third and average data that has produced 
the smallest difference with respect to the target 
image. 

7. An image coding method according to Claim 6, 
wherein said at least two of said predictive images 
are said second and third predictive images. 

8. An image coding method according to Claim 1, 
wherein each of said predictive image is one frame 
image. 

9. An image coding method according to Claim 1, 
wherein each of said predictive image is a portion of 
one frame image. 

10. An image predictive coding apparatus comprising: 

an input means to which an input image is 
input, said input image comprising a luminance sig- 
nal expressing each pixel value, and a transmittance 
signal expressing the transparency of each pixel 
value, 

a predictor for generating a predictive image; 

a first adder for generating a difference signal 
from the input image and the predictive image. 

a coder for compression coding said differ- 
ence signal, and outputting a compression coded 
image; 

a local decoder for decoding said compres- 
sion coded image to reproduce said difference sig- 
nal; 

a second adder for adding the decoded dif- 
ference signal to the predictive image to regenerate 
the reconstructed image; 

a first frame memory to which the recon- 
structed image is stored as a template; and 

a second frame memory to which the recon- 
structed image is stored as an adjacent Image which 
is adjacent to said input image; 

said predictor generating a predictive image, 
using the image stored in the first image memory, 
when the input image is the template, and generat- 
ing a predictive image, using the image stored in the 
first and second frame memories, when the input 
image is the adjacent image. 

11. An image coding apparatus for coding a target 
image in a sequence of images, each image com- 
prising a luminance signal expressing each pixel 
value, and a transmittance signal expressing the 
transparency of each pixel value, said apparatus 
comprising: 

means for preparing a template which is a 
representative of the images in sequence; 

means for generating a first predictive image 
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by predicting said target image based on said tem- 
plate, and producing first data related to said first 
predictive image; 

means for producing a difference between 
said first predictive image and said target image; 5 

means for generating a second predictive 
image by predicting said target image based on a 
first adjacent image which is to be displayed chron- 
ologically adjacent on one side of said target image, 
and producing second data related to said second 10 
predictive image; 

means for producing a difference between 
said second predictive image and SeUd target image; 
and 

means for selecting one of said first and sec- is 
ond data that has produced the smallest difference 
with respect to the target image. 

12. An image coding apparatus according to Claim 11, 
where in said first adjacent image is chronologically 20 
before said target image. 

13. An image coding apparatus according to Claim 1 1 , 
where in said first adjacent Image is chronologically 
after said target image. 2s 

14. An image coding apparatus according to Claim 11. 
further comprising: 

means for generating a third predictive image 
by predicting said target image based on a second 30 
adjacent image which is to be displayed chronolog- 
ically adjacent on the other side, opposite to said one 
side, of said target image, and produdng third data 
related to said third predictive image; 

means for producing a difference between 35 
said third predictive image and said target image; 
and 

said selecting means selecting one of said 
first, second and tiiird data that has produced the 
smallest difference with respect to the target image. 40 

15. An image coding apparatus according to Claim 1 1 , 
further comprising: 

means for generating an average predictive 
image by obtaining a weighted average of said first 4S 
and second predictive images, and producing an 
averaged data related to said average predictive 
image; 

means for producing a difference between 
said average predictive image and said target so 
image; and 

said selecting means selecting one of said 
first, second and average data that has produced the 
smallest difference with respect to tiie target image. 

55 

16. An image coding apparatus according to Claim 14, 
further conrprising: 

means for generating an average predictive 
image by obtaining a weighted average of at least 



two of said predictive images, and producing an 
averaged data related to said average predictive 
image; 

means for producing a difference between 
said average predictive image and said target 
image; and 

said selecting means selecting one of said 
first, second, third and average data that has pro- 
duced the smallest difference With respectto the tar- 
get image. 

17. An image coding apparatus according to Claim 16, 
wherein said at least two of said predictive images 
are said second and tiiird predictive images. 

18. An image coding apparatus according to Claim 1 1 , 
wherein each of said predictive image is one frame 
image. 

19. An image coding apparatus according to Claim 1 1 , 
wherein each of said predictive image is a portion of 
one frame image. 

20. A digital image transmission method comprising the 
steps of: 

preparing a plurality of templates represent- 
ative of a digitized image sequence; 

processing said templates separately to 
make compression coded data as template coded 
data, and then reproduced as reproduced tem- 
plates; 

producing coded motion picture data of a 
sequence images through compression process 
using one of said templates; and 

transmitting said coded motion picture data 
after first transmitting all of tiie template coded data. 

21. A digital image transmission method according to 
claim 20, wherein one template coded data and 
coded motion picture data that uses said one tem- 
plate are transmitted in a group. 

22. An image predictive coding method for predictive 
coding images in a sequence of images, each image 
comprising a luminance signal expressing each 
pixel value, and a transmittance signal expressing 
the ti-ansparency of each pixel value, said method 
comprising the steps of: 

dividing into a plurality of target subareas the 
entire area of each target image identified for coding 
by the corresponding image transmittance signal. 

determining the luminance signal and trans- 
mittance signal of the predictive subarea relative to 
the luminance signal and transmittance signal of the 
target subarea, 

calculating a substitute pixel value for each 
target subarea containing at least one pixel value not 
to be coded; 

generating an optimum predictive subarea by 
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substituting the substitute pixel value into the predic- 
tive subarea and the target subarea for the pixel val- 
ues therein not to be coded, and 

obtaining and coding the difference between 
the target subarea and the optimum predictive sub- 5 
area. 

23. An image predictive coding method according to 
Claim 22 wherein the calculated substitute pixel 
value is an average of the pixel value to be coded 10 
and contained in the predictive subarea. 

24. An image predictive coding apparatus comprising: 

an input terminal for receiving an input image 
comprising a luminance signal expressing each is 
pixel value, and a transmittance signal expressing 
the transparency of each pixel value, 

a first image block generator for dividing the 
input image into a target block. 

a predictor for generating a predictive image 20 
from the image stored in a frame memory, 

a second image block generator for dividing 
tiie predictive image into prediction blocks, 

a padding value generator, upon receipt of 
the prediction blocks, for calculating the substitute 2s 
pixel value based on the pixel values to be coded in 
the prediction block, and outputting a calculated 
substitute pixel value; 

a first pixel value padding means, upon 
receipt of said substitute pixel value and said target 30 
block, for substituting the substitute pixel value into 
the target block for the pixel values therein not to be 
coded; 

a second pixel value padding means for 
receiving the substitute pixel value and prediction 35 
block, and for substituting the sut}stitute pixel value 
into the prediction block for the pixel values tiierein 
not to be coded, 

a first adder for generating a difference signal 
from the target and prediction blocks in which the 40 
substitute pixel value has been substituted, 

a coder for obtaining compression codes of 
said difference signal and outputting a compression 
coded image data; 

a local decoder for decoding and reproducing 4S 
the difference signal; 

a second adder for adding the decoded dif- 
ference signal to the prediction block to which tiie 
substitute pixel value has been substituted to regen- 
erate the reconstructed image, and so 

a frame memory for storing the reconstructed 

image. 

25. A decoding apparatus for decoding image data 
coded by dividing an input image comprising a lumi- ss 
nance signal expressing each pixel value, and a 
transmittance signal expressing the transparency of 
each pixel value, into plural target blocks, determin- 
ing the luminance signal and transmittance signal of 



tiie predictive block relative to the luminance signal 
and transmittance signal of tiie target block, calcu- 
lating a substitute pixel value for each target block 
containing at least one pixel value not to be coded 
based on the pixel values to be coded in tiie corre- 
sponding predictive blocK substituting the calcu- 
lated substitute pixel value into tiie predictive block 
and tiie target block for the pixel values therein not 
to be coded, and then obtaining and coding the dif- 
ference between the target block and the predictive 
block, comprising: 

an input terminal receiving said image data; 

a decoder for generating a difference signal; 

a frame memory bank, 

a prediction block generator for generating a 
prediction block using the image input from the 
frame memory bank. 

a padding value generator for receiving the 
prediction block, and for generating a sutdstitute pixel 
value based on the pixel values to be coded in tiie 
prediction block. 

a pixel value padding means, upon receipt of 
tiie substitute pixel value and the prediction block, 
for substituting the substitute pixel value into the pre- 
diction block for the pixel values therein not to be 
coded, 

an adder for adding the substituted prediction 
block containing the substitute pixel value to the dif- 
ference signal to generate a reconstructed image, 
and outputting the reconstructed image and stores 
tiie reconstructed image to the frame memory bank. 

26. A digital image transmission method for transmitting 
a digital image comprising the steps of: 

separating an image to a plurality of image 

layers; 

processing compression coding to each of 
the plural image layers; 

multiplexing the layers in a particular 
sequence; and 

transmitting the multiplexed layers. 

27. A digital image transmission method according to 
daim 26, wherein said plurality of layers comprises 
a background layer and a foreground layer. 

28. A digital image transmission method according to 
claim 27. wherein said layers are multiplexed in 
sequence starting from tiie background layer. 

29. A digital image transmission method according to 
daim 27. wherein said layers are multiplexed in 
sequence starting from tiie foreground layer. 

30. An apparatus for decoding and synthesizing digital 
images comprising a plurality of combined image 
layers coded by separately conrpression coding 
each of the plurality of image layers, and multiplexing 
the layers in sequence starting from the background 
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layer, comprising: 

an external line input terminal, 

a decoder, 

a synthesizer, and 

a frame memory, 5 
said decoder decoding each layer of the 
coded data input to the external line Input terminal 
to a reconstructed image in the multiplexed 
sequence; 

said synthesizer receiving the reconstructed io 
image and a previous synthesized image supplied 
from the frame memory, and combining the recon- 
structed image and the previous synthesized image 
to generate a new synthesized image; and 

said frame memory storing said new synthe- is 
sized image. 

31. A digital image decoding and synthesizing appara- 
tus according to Claim 30, further comprising: 

a first selector switch between the external 20 
line input terminal and the decoder, said first selector 
switch being controlled to not connect the external 
line input terminal and the decoder when an image 
layer that is not to be reproduced is Input. 

25 

32. A digital image decoding and synthesizing appara- 
tus according to Claim 31 , further conrprising: 

a second selector switch disposed between 
the synthesizer and the frame memory, said second 
selector switch being controlled to not connect the 30 
synthesizer and the frame memory when the image 
stored to the frame memory is not to be updated. 

33. A digital image decoding and synthesizing appara- 
tus according to Claim 30. further comprising: as 

a first frame memory; 
a second frame memory; 
a first switch; 
a second switch; 

a third switch; 40 
said synthesizer having a first input terminal 
and a second input terminal, said first input terminal 
of the synthesizer connected to either one of the 
decoder and the second frame memory by said first 
switch, and said second input terminal of the syn- 4S 
thesizer connected to ether one of the first frame 
memory and the second frame memory by said sec- 
ond switch, and 

said synthesizer producing a synthesized 
image which is stored to either one of the first and so 
second frame memories by said thiixl switch. 

34. A digital image decoding and synthesizing appara- 
tus according to Claim 30. wherein said coded data 
multiplexed in sequence from the background layer ss 
is replaced by data multiplexed in sequence from the 
foreground layer. 



35. A digital image decoding and synthesizing appara- 
tus according to Claim 30, wherein the images in 
each of the plurality of layers conprises a luminance 
signal expressing the pixel value, and a transmit- 
tance signal expressing the transparency of the pixel 
value. 

36. An apparatus for decoding and synthesizing digital 
images comprising a plurality of combined image 
layers coded by separately compression coding 
each of the plurality of image layers, and multiplexing 
the layers in sequence starting from the background 
layer, comprising: 

an external line input terminal, 

a decoder, 

a synthesizer, 

a frame memory, and 

an output device. 

wherein said decoder decodes each layer of 
the coded data input to the external line input termi- 
nal to the luminance signal and transmittance signal 
of the reconstructed image in the multiplexed 
sequence, 

said synthesizer receives the luminance sig- 
nal and transmittance signal of the reconstructed 
image, and the luminance signal of tiie synthesized 
image supplied from the frame memory, 

said synthesizer generates the luminance 
signal of a new synthesized image by overlaying the 
luminance signal of the reconstructed image to the 
luminance signal of tiie synthesized image at tiie 
ratio defined by the transmittance signal of tiie 
reconstructed image, and 

said synthesizer produces the luminance sig- 
nal of the new synthesized image for storage to the 
frame memory, and display to the output device. 

37. An apparatus for decoding and synthesizing digital 
images comprising a plurality of combined image 
layers coded by separately compression coding 
each of the plurality of image layers, and multiplexing 
the layers in sequence starting from the foreground 
layer, comprising: 

an extemal line input terminal, 

a decoder, 

a first synthesizer. 

a second synthesizer, 

a first frame memory. 

a second frame memory, and 

an output device. 

wherein said decoder decodes each layer of 
the coded data input to the external line input termi- 
nal to the luminance signal and transmittance signal 
of the reconstructed image in the multiplexed 
sequence; 

said second syntiiesizer receives the trans- 
mittance signal of the reconstructed image, and tiie 
transmittance signal of the synthesized image sup- 
plied from the second frame memory; 
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said second synthesizer generates the trans- 
mittance signal of a new synthesized image by over- 
laying the transmittance signal of the reconstructed 
image and the transmittance signal of the synthe- 
sized image, and 5 

said second frame memory stores the new 
synthesized image; 

said first synthesizer receives the luminance 
signal of the reconstructed image, and the lumi- 
nance signal of the synthesized image supplied from 10 
the first frame memory; 

said first synthesizer generates the lumi- 
nance signal of the new synthesized image by over- 
laying the luminance signal of the reconstructed 
image and the luminance signal of the synthesized 75 
image at the ratio defined by the transmittance sig- 
nal of the synthesized image supplied from the sec- 
ond frame memory and 

said first synthesizer produces the luminance 
signal of the new synthesized image for storage to 20 
the first frame memory, and for display to the output 
device. 
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